Prefix Path Streaming: a New Clustering Method for XML Twig Pattern Matching

نویسندگان

  • Ting Chen
  • Tok Wang Ling
  • Chee-Yong Chan
چکیده

Searching for all occurrences of a twig pattern in a XML document is an important operation in XML query processing. Recently a class of holistic twig pattern matching algorithms has been proposed. Compared with the prior approaches, the holistic method avoids generating large intermediate results which do not contribute to the final answer. The method is CPU and I/O optimal when twig patterns only have ancestor-descendant relationships.The holistic twig-pattern matching method proposed earlier [1] operates on element streams which cluster all XML elements with the same tag name together. In this paper we introduce a clustering method called Prefix Path Streaming (PPS) and new holistic twig pattern matching algorithms based on PPS. PPS clusters elements of XML documents according to the paths from root to the elements. This clustering approach avoids unnecessary scanning of irrelevant portion of XML documents.More importantly, we develop optimal algorithms based on PPS streaming which can process a large class of twig patterns consisting of both ancestor-descendant and parent-child relationships.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluation of Twig Pattern Queries for Streaming Xml Data Using Lineage Encoding

In this paper, we propose an energy and latency efficient XML dissemination scheme for the mobile computing. It describes a novel unit structure called G-node for streaming XML data in the wireless environment. It exploits the profit of the structure indexing and attributes summarization that may integrate relevant XML elements into a group. It provides a way for choosy access of their attribut...

متن کامل

XML Dissemination Scheme for Mobile Computing Based on Lineage Encoding

In wireless environments, broadcasting is an efficient and scalable method to broadcast information to a massive number of clients. We propose an energy and latency efficient XML dissemination scheme for the wireless mobile computing environments. This paper presents a novel unit structure called G-node for streaming XML data in the wireless system. It applies the benefits of the structure inde...

متن کامل

Dissemination of Xml Data in Wireless Environment Supporting Twig Pattern Queries

The main aim of this paper is to improve energy and latency efficiency of XML dissemination scheme for the mobile computing, which is based on Lineage Encoding, G-node and scheduling algorithm for streaming XML data in the wireless environment. In this paper we propose a new broadcasting scheduling algorithm Frequently Access First (FAF) which effectively organize XML data on wireless channels....

متن کامل

StreamTX: extracting tuples from streaming XML data

We study the problem of extracting flattened tuple data from streaming, hierarchical XML data. Tuple-extraction queries are essentially XML pattern queries with multiple extraction nodes. Their typical applications include mapping-based XML transformation and integrated (set-based) processing of XML and relational data. Holistic twig joins are known for the optimal matching of XML pattern queri...

متن کامل

An Well-Organised Wireless XML Streaming Supporting Twig Pattern Queries using Lineage Encoding

In this paper, we propose an energy and latency efficient XML dissemination scheme for the mobile computing. It describes a novel unit structure called Gnode for streaming XML data in the wireless environment. It exploits the profit of the structure indexing and attributes summarization that may integrate relevant XML elements into a group. It provides a way for choosy access of their attribute...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004